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Abstract 

There exists a bijection between one stack sortable permutations -permutations which avoid 
the pattern 231- and planar trees. We define an edit distance between permutations which is 
coherent with the standard edit distance between trees. This one-to-one correspondence yields 
a polynomial algorithm for the subpermutation problem for (231) avoiding permutations. 

Moreover, we obtain the generating function of the edit distance between ordered trees and 
some special ones. For the general case we show that the mean edit distance between a planar 
tree and all other planar trees is at least n/ln(n). 

Some results can be extended to labeled trees considering colored Dyck paths or equivalently 
colored one stack sortable permutations. 

1 Introduction 

The edit distance between two trees is the minimal number of edit operations to transform one 
tree into the other. The edit operations are deletion (edge contraction), insertion of an edge and 
relabeling of a vertex. 

The main problem is to find efficient algorithms to compute this distance between ordered labeled 
trees. Many algorithms have been proposed ^El- The basic idea of all these dynamic algorithms 
arises from the paper of Zhang and Shasha [Q . Further improvements have been made [2] ■ 

Comparing the structure of molecules and finding the preserved ones during a genetic mutation 
can be seen as an edit distance problem. The application field of this problem is not restricted 
to biology: in computer vision, objects are represented by their skeletons -which are trees-, and in 
computer science, edit distance is used to compare structural similarities between XML documents 

m 

But no combinatorial interpretation has been made of the edit distance between trees. In this 
article, we introduce one-stack sortable permutations |S] . These one-stack sortable permutations 
are (231) pattern- avoiding permutations and we show that they are in one-to-one correspondence 
with ordered trees. 

Moreover the edit operations can be easily described in terms of one-stack sortable permutations. 
This leads to a purely combinatorial explanation of the edit distance. 

Some polynomial algorithms are known to compute the edit distance between trees pp. By our 
correspondence, we show that computing the greatest common pattern between two (231)-avoiding 
permutations is also polynomial whereas it is NP-complete for general permutations ■ 

2 Definitions 

2.1 One-stack sortable permutations 

We describe in this section an encoding for planar trees. We number the edges of the tree by a 
postfix traversal and then read the permutation by a prefix traversal. The obtained permutations 
are called one stack sortable permutations 0J[S]. An alternate definition is the following: 



Definition 1. Let n G N, a one-stack sortable permutation on {l...n} is a permutation a such 
that a = In J where I and J are one-stack sortable permutations on {1 . . .p} and {p + 1 . . . n — 1} 
respectively. Notice that I or J could be empty. 

Note that in the sequel, permutations are seen as words. 

Theorem 1. One-stack sortable permutations are in one-to-one correspondence with rooted ordered 
trees. 

Proof. Given a tree T with n edges, number the edges by a postfix Depth First Search Traversal 
(DFS). Read it again by a prefix DFS. It is clear that the obtained permutation is of the form In J. 
Moreover / corresponds to the encoding by a postfix DFS of the left subtree as shown in Figure ^ 
The same goes for J but its numbers are shifted. 

Conversely, take a one-stack sortable permutation a = In,] . 

• If a = k then the corresponding tree is a single edge. 

• If a = InJ then the corresponding tree T a is the tree obtained by taking an edge e = (xy) 
(corresponding to n) where x is the root of T CT . Since / and J are also one-stack sortable 
permutations, we can recursively build the corresponding trees Tj and Tj. Put them at each 
end of the edge e, ie Tj is hanging on x such e is the rightmost edge of x, and Tj on y. 

This construction is unique. 



If <x is a one-stack sortable permutation, let T(tr) denote the tree associated to a. Conversely, 
if T is a tree, its associated one-stack sortable permutation is denoted by O(T). Moreover, in the 
sequel, o~k will either denote the fc-th letter of the word a or the corresponding edge in T{o~). 

Definition 2. A subsequence of a permutation a = o~i . . ,o~ n is a word a' = o~i 1 . . . o~i k where 
i\, . . . , ife is an increasing sequence of elements of {1, . . . , n}. 

Let $ be the bijective mapping of {er^ , a i2 , . . . , a ik } on {1, ... ,k} preserving the order on . 

The normalized subsequence (pattern) a' is equal to < &(o''). 

Remark 1. The one-stack sortable permutations are the permutations avoiding the normalized 
subsequence (pattern) 231 

2.2 Edit distance 

We briefly recall the definition of the edit distance between trees. Given two trees, the edit distance 
is the minimal number of operations necessary to transform one into the other. The operations are: 

• Deletion : This is the contraction of an edge; two vertices are merged. Only one label is kept. 




x 



Figure 1: Coding a tree with a one-stack sortable permutation. 
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• Insertion : This is the converse operation of deletion. 




Figure 2: Insertion and Deletion operations on a tree. 
A cost can be given to each operation. In this article we take 1 for every cost. 

3 Distance on one-stack sortable permutations 

Since one-stack sortable permutations arc in one-to-one correspondence with planar trees, we define 
similar edit operations between one-stack sortable permutations and show that these definitions 
match with edit distance between trees. Moreover, we give a combinatorial interpretation of the 
distance. 

A factor of a permutation a = o~\o~2 ■ ■ ■ o~ n is a factor of the word <J\o~2 ■ ■ - o~ n i.e. a word of the 
form tTfcO-fc+i . . -o-k+i- 

A factor / is compact if it is a permutation of an interval of N. 

A factor / of a is complete if no non-empty factor g of a verifies both: 

1. fg is compact where fg is the concatenation of the words / and g; 

2. the greatest element of fg is equal to the greatest element of /. 

Take for example the one-stack sortable permutation a — (1524376). The complete factors of a 
arc {1},{15243}, {1524376}, {5243}, {524376},{2},{243},{43}, {3}, {76},{6}. 




(1524376) 

Figure 3: Tree associated to a = (1524376). 

A subtree T 1 of T is a tree such that T \ T' is connected. 
Lemma 1. Each compact factor of a are in one-to-one correspondance with: 

• to a subtree 

• to a internal path P in T = T(o~) where each internal vertex of P is of degre 2 in T and P 
does not end at a leaf (P can be an internal edge). 
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Proof. First let prove that the subset of edges corrcpsonding to a compact factor is connected. 

Let a' be a compact factor of a — 0(T). Let E a > be the set of edges corresponding to a' in T. 
Suppose that E a i is not connected. Let E\ and £2 be two connected components. Let v be the first 
common ancestor of E\ and P 2 . Let Pi (resp. P%) be the path starting from v and ending at the 
first vertex of E\ (resp. E2). Note that we can choose E\ and E2 such that edges of Pi and P2 are 
not in E a i. Suppose that Pi is at the left of P2 (See Figure In the prefix DFS of T, edges of 



Figure 4: Compact factors are connected components. 

P2 are visited between those of E\ and E%. Thus they should appear in a' , hence P2 = 0. Thus 
v € E2 so that Pi links E2 and E-y. In the postfix DFS, the edges of Pi have labels greater than 
those of Pi and less than E2. If P2 ^ 0, it implies that a' is not compact. Thus E a i is connected. 




Figure 5: Subtree of T induced by E G i. 



Consider the subtree T 1 of T induced by E a t . It consists of E a i plus all vertices of T that have 
an ancestor in E a i as shown in Figure |SJ 

E a i can be decomposed into edge-disjoint paths Pi thanks to the prefix DFS (See Figure [SJ). Fi 
is the subtree pending on P, which can be empty. 

The prefix DFS of T 1 (which is a factor of a) gives the associated permutation 0(P L ')0(Pi)0(P^) 
9(P 2 ) . . . e(P£)9(P fe ). So a' = e(P 1 / )6(Pi)e(P 2 ')e(P 2 ) . . . 0(PQ, hence P = 0, Vi < fc. 

• Suppose pt 7^ 0. If fc > 1, then the edges of Pfc are visited after at least one edge of P[, and 
before the edges of P' k in the postfix DFS. Since a' is compact, it implies k = 1. 

• If Pfc = 0, is a subtree. 
The converse is straightforward. 

□ 

Proposition 1. The set of complete factors of a corresponds to the set of subtrees of the associated 
tree. 

Proof. Let T' be a subtree of T and a = O(P). The edges of T' are visited consecutively by the 
postfix (resp. prefix) DFS of T. Thus the sequence of edges of T' is a compact factor crfcUfc+i . . . o~k+i 
of a. Ofc+/+i is an edge which is visited after all edges of T' by the prefix DFS. Thus it is the first time 
this edge is visited by the traversal. Hence, its label is greater than those of T' . Thus crfcOfc+i . . . ak+l 
is complete. 
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Figure 6: Insertion operations for / = (43). 



Conversely, let a' be a complete factor. As a' is compact, by Lemma^ it corresponds either to 
a subtree or to an internal path P with a subtree F hanging on P. Q(P)Q(F) = a'Q(F) is also a 
compact factor of a and it has the same maximum as a' which contradicts the completeness of a' . 

□ 

Remark 2. Let a be a one-stack sortable permutation and o~ k = (p(v k )v k ) an edge where p(vk) 
denote the parent of v k . Let a' be the shortest complete factor of a such that a' = a k a k +\ ■ ■ -o~k+i 
where o~i = (p(vi)vi). By previous proposition T{a') is a subtree ofT(o~). The children of v k are the 
vertices v k+t such that i < I and a k > a k+i > a k+ i,a k+2 , ■ ■ ■ , o~k+%-i ■ 

Let cr = cti . . . <7fc be a word of {1 . . . n} and a be a letter of {1 ... n}. We denote by [<r] a the 
word a[ . . . a' k where 

, _ \ <Ji if Ui < a 

1 o~i + 1 otherwise 

Definition 3. We define two operations on permutations which map the standard definition on trees 
(US): 

1. Deletion : Let 1 < k < n. The deletion (o~ k — > A) is the removal of a k in a permutation a and 
the renormalization on SVi-i of the result. We will either talk about the deletion of the edge 
a k or the deletion of the vertex v such that a k is the edge p(v)v. 

2. Insertion (see Figure \Q) : (A — * 0) corresponds to the transformation of the permutation 
cr = into a' = (1). If a =/: 0, let f be a complete factor of a. Then, a = ufv with u,v 
factors of a. 

(a) (A — > /): The resulting permutation is a' = \u\ a af\v\ a , & = max{f}+l. This corresponds 
to the insertion of an inner vertex with T(/) as subtree. 

(b) (A /) : The resulting permutation is a' = [u] a fa[v] a , a = max{f} + 1. This corre- 
sponds to the insertion of a leaf as the right sibling ofT(f). 

(c) (A — > f) : The resulting permutation is a' = [u] a a[f] a [v] a , a = min{f}. This corresponds 
to the insertion of a leaf as the left sibling ofT(f). 

We study now these operations on the permutation a = (1524376). 
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Figure 7: Insertion in permutation a = 1524376. 
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The array of Figure \7\ gives all the permutations that can be obtained with a single insertion in 

a. 

We prove now that the operations (deletion and insertion) defined on one-stack sortable permu- 
tations are in fact internal operators for one-stack sortable permutations. Moreover, these operators 
define an edit distance between permutations coherent with the usual edit distance between trees. 

Lemma 2. The Deletion /Insertion algorithm yields a one-stack sortable permutation. 

Proof. • Deletion : The proof is straightforward considering the one-to-one correspondence with 
trees and one-stack sortable permutations. Consider a tree labeled by a depth first traversal. 
Deleting the edge i from this tree changes all labels greater than i by subtracting 1. 

• Insertion : Let a be a one-stack sortable permutation and / be a complete factor of a = ufv. 
By Proposition ^ / corresponds to a subtree of T(a). 

1. (A — ► /): Let T = T(a) and (ei, e 2 , . . . , e„) be the edges of T ordered by a prefix DFS 
of the tree. Note that a = aT(ei)ar(e2) • • • ar(e n ) where a(i) is the label of the edge i 
in T. 

Let T' be the tree obtained by the insertion of an internal vertex v (a = (p(v)v)) at the 
root vertex of the subtree T(f). Moreover T(/) is a subtree hanging on v. Let a" = 
8(T'). A prefix traversal of T' orders the edges of T" as follows: (ei, e 2 , . . . , e/, a, ej+i, . . . , e„). 
Since a" is obtained by a prefix traversal, a" = u'af'v'. Since the edges of / appear 
before a in the postfix DFS, /' = /. The edge a in a postfix DFS appears just after /. 
Thus its label is max{f} + 1. All the edges visited after / in T (and so after a in T") by 
the postfix DFS have their labels increased by 1. Thus a" = [u] a/[i)] a = a' . 

2. (A — ► /), (A — > /) : The same arguments as for (A — -> /) hold. 

□ 

Proposition 2. Insertion and deletion are inverse operations. 
Proof. There are two different kinds of deletions in a tree T. 

1. Deletion of an inner vertex v. Consider the subtree T of T hanging on v. It corresponds to 
a complete factor / in a = 0(T). This contraction corresponds to the inverse operation of 
(A-/). 

2. Deletion of a leaf. There are three different cases: 

• Deletion of a vertex with no sibling. This is the same as deleting the parent of this vertex 
which is an inner vertex except if the tree is reduced to a single edge. 

• Otherwise, this vertex has either: 

— A left sibling v'. Consider the subtree hanging at v' (including p(v')v'). It corre- 
sponds to the factor /. The inverse operation is (A A /) 

— A right sibling v' . Consider the subtree hanging at v' (including p(v')v'). It corre- 
sponds to the factor /. The inverse operation is (A — > /) 

□ 

Definition 4. The distance between two one-stack sortable permutations o\ and o~2 is the minimal 
number of operations -deletion or insertion - to transform o~i into o~2- 

For example let o\ = 31264587 and a% = 1524376. We want to transform u\ into u%. 
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• 31264587 (1 ^ A) > 2153476 

• 2153476 (1 " A) > 142365 

• 142365 (A ^ 3) > 1524376 

Theorem 2. The edit distance between ordered trees is the distance between the associated one-stack 
sortable permutations. 

Proof. This is a consequence of Proposition □ 
Theorem 3. The edit distance between one-stack sortable permutations o\ and 02 is equal to 

|<ri| + |o- a | — 2|u| 

where u is a largest normalized subsequence (pattern) of o~\ and o~i- 

Proof. The edit distance d(o~i, 0-2) between o~\ and o~2 is given by the minimal number of insertions 
and deletions. If t\ is an insertion and t% is a deletion then there exist a deletion t\ and an insertion 
t'2 such that t\t 2 (a) = t' 1 t , 2 (o'). Note that t[ and t' 2 depend on the one-stack sortable permutation 
a. 

Considering the sequence of edit operations, there exists a sequence made of deletions then 
insertions that transforms u\ into <j 2 . We denote this sequence by D\ . . . DiO± . . .Ok, I + k = 
d(o- 1 ,a 2 ). 

Consider the one-stack sortable permutation a' = D\ . . . Di{a{). Take u — a' . u is a normalized 
subsequence of o\ because deleting an edge from a one-stack sortable permutation yields a normal- 
ized subsequence of the original one-stack sortable permutation, u is also a normalized subsequence 
of a 2 because inserting an edge in a one-stack sortable permutation s yields a one-stack sortable 
permutation s' and s is a normalized subsequence of s' . 

Conversely, take u as a maximal normalized subsequence of o~\ and a 2 . It is straightforward to 
find \o~\ \ — \u\ operations of deletions such that those deletions transform <j\ into u. The same goes 
for (7 2 and u. 

□ 

Corollary 1. Finding the greatest common pattern between two one-stack sortable permutations is 
polynomial. 

In [H], they proved that finding the greatest common pattern between two permutations is NP- 
complete. We prove here that the problem becomes polynomial when restricting to one-stack sortable 
permutations, ie (132) or (231)-avoiding permutations. In fact, the algorithm of Zhang and Shasha 

on trees solves the problem on one-stack sortable permutations because the algorithm outputs 
not only the distance but also the greatest common subtree. 




4 Lower bounds on average edit distance 

In this section we study the average edit distance between a given planar tree T with n vertices and 
all other planar trees with n vertices. We show that this average distance is lower bounded by 

Lemma 3. Let T be a planar tree with n vertices. There are at most n — 1 different deletions and 
3n 3 insertions allowed in T. 
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Proof. The number of deletions is upper bounded by the number of edges i.e.n — 1. 

The number of insertions is bounded by 3 times the number of subtrees (or complete factor of 
the corresponding permutation). The number of subtrees of T rooted at vertex v is bounded by 
d(v) 2 where d(v) denotes the degree of vertex v. Thus the total number of subtrees is bounded by 
E v d{v) 2 . □ 

Theorem 4. Let Tq be a tree with n vertices. The proportion of planar trees with n vertices at 
distance at most 0(n/ln(n)) tends to 0. 

The average distance between To and the set of planar trees is lower bounded by n/ln{n). 



Proof. Let T be a planar tree. Let A k = {T e T n , dist(T , T) < 2k}. Note that A = {T }. A tree 
Tk € A k is obtained from T by I < k deletions then I insertions. Thus \A k \ < (n — l) k (n 3 ) k < n ik . 
But the number of planar trees C n = n ^ nn ■ So that the proportion of planar trees at distance at 
most 0(n/ln(n)) tends to 0. 

Hence the average distance is lower bounded by n/ln{n). □ 



5 Generating functions 

Using the combinatorial interpretation of the distance, we compute the generating functions of 
the edit distance between planar trees with n edges and some special ones as shown in Figure |S1 
Moreover, we deduce the average distances from the generating functions. 



L-"V 3 4" 
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Figure 8: Some canonical trees. 



5.1 Generating function of the edit distance between one-stack sortable 
permutations and Id = 1 2 ... n 

We denote by Si(t, q) the generating function of one-stack sortable permutations where t counts the 
size of the permutation and q the edit distance between one-stack sortable permutations and Id. 
This is the distance between a tree and the trivial one which is made of n edges and of height 1 . 

Tree interpretation of the largest increasing subsequence 

Proposition 3. The length of a largest increasing subsequence of a one-stack sortable permutation 
is the number of leaves of the associated tree. 

Proof. Let T be a planar rooted tree and a the associated one-stack sortable permutation. We call 
a leaf-edge an edge incident to a leaf. 

1. The subsequence of a made of the leaf-edges is increasing because the order in which the 
leaf-edges are visited by a prefix traversal is the same than by a postfix traversal. 
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2. Suppose that we take an increasing subsequence a' of a. This subsequence is in one-to-one 
correspondence with some edges in the tree. Suppose that there is an internal one 7 = {p(v)v). 
Then, by the postordering of the edges, each edge (p(v)v) such that v = p(v) has a smaller 
label and appears in a after the edge 7. Thus, none of these edges are in a' . Moreover, there 
is at least one leaf edge belonging to the subtree T 7 hanging on v. Replace edge 7 by a leaf of 
Try. The prefix traversal ensures that the obtained subsequence is an increasing one. 

□ 

Proposition 4. The number of rooted planar trees with n edges and k leaves is equal to the number 
of rooted planar trees with n edges and n + 1 — k leaves. 

Proof. This is a direct consequence of the symmetry of the Narayana numbers ^(tjft^i) which 
count the number of planar trees with n edges and k leaves. 

□ 

Generating function We now compute the generating function I(t,p) of one-stack sortable per- 
mutations of size t and largest increasing subsequence of size p. 

. [I(t,p)] = 1 

• [/(t,jOh=p 

• [I(t,p)] 2 = (p + p 2 ) 



n-2 



[i(t,p)} n =p[i(t,p)] n - 1 +j2[i(t,p)} i [i(t,p)} n - 1 - i (i) 

i=0 

This formula comes from the decomposition of a one-stack sortable permutation a into InJ with 
n > 1. The largest increasing subsequence of a is the union of the largest one of / and the largest 
one of J unless J is empty - in this case, the largest subsequence is the largest one for In -. 
From this formula we deduce: 

I(t,p) = 1 + (p - l)tl(t,p) + tl 2 (t,p) (2) 

• 1 comes from the case n = in the equation JIJ. 

• ptl(t,p) comes from p[I(t,p)] n —i . 
It follows from equation 

f(f>p) = l+OzrtL^EWEl^+WI ( 3 ) 

Let Si (t, q) be the generating function of the difference between the lengths of the one-stack sortable 
permutation and the largest increasing subsequence in it. 

. [S 1 (t,q)] = -l 

. [Si(t,g)] a =0 

• [Si(t,q)h = q 

Lemma 4. 

I(t,p) = l+p + pS 1 (t,p) (4) 
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Proof. 

T 

= EE^-p)U-^ T+H+1 

r>l /3=1 

T 

T>1 = 
= l+p(&(t,p) + l) 

The end of the proof is straightforward using Proposition □ 
Theorem 5. 

S 1 {t,q) = S^q 2 ) 

i + ( q 2 - i)t - ^/(g 2 - \)H 2 - 2( g 2 + iyri 

2tq 2 

5.1.1 Average distance 

Theorem 6. The average edit distance between rooted planar trees with n edges and Id is n — 1. 

Proof. 1. The average distance 5 can be obtained from the generating function S\(t,q) in the 
following way: 



dq 
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• S = ^ff)" where C{n) is the n-th Catalan number. 

This easy computation yields 8 = n — 1 but a direct combinatorial interpretation proves this 
result in a more comprehensive way. 

2. This is a direct consequence of Propositions [3] and 0] Another proof can be found in [HJ ■ In 
|5] the result is more general. Thus we provide here a simpler proof for this special case. 

□ 

5.2 Generating function of the edit distance between one-stack sortable 
permutations and n{n — 1) ... 1 

This is the distance between a tree and the trivial one which is made of n edges and is of height n. 
It is equivalent to finding the largest decreasing subsequence in the one-stack sortable permutation. 

We compute the generating function D(x,y, z) of trees with respect to the number of edges x, 
the height of the tree y and the number of leaves z at maximal depth. 



Proposition 5. 



1 xyz 

D{x,y,z) = yD{x,y,- ) - yD(x, y, 1) + (5) 

1 — xz 1 — xz 



Proof. 



i=i \ ' 



li 



The coefficient [D(x, y, is equal to the number of ways to add k leaves at depth j to any tree 

with i — k edges, depth j — 1 and I leaves at depth j — 1. ( l -1 ) is the number of ways to add k 
leaves to I leaves at depth j. 



D(x,y,z) = £££di,.j,fcan/ J 2 



i>l j>l k>l 

£ £ £ £ { + l k ~ 1 ) di-kj-uxVz* + y 

'>1 i>2 fe>l i>l ^ ' i>l 

E E E E(-!) fc *-w-m«v^ + » £(-r 

i>i j>2 fe>i ;>i ^ ' i>i 

E E E Et- 1 )* ( fc *j-i,^VM fc + y £ w 

i>l j>2 fc>l ;>i ^ ' i>l 



Using 



fc=0 



k 



x k aT n ~ k 



D{x, y ,z) = EEE^-^r'-^-^v+i/EW 

i>l j>2 Z>1 i>l 

1 xyz 
= yD{x,y,- ) -yD(x,y, 1) + 



1 — IZ 1 — xz 



□ 



Let S2(x,y) be the generating function with respect to the length n of the one-stack sortable 
permutation and the edit distance between this one-stack sortable permutation and n(n — l)(n — 



2)...l. Then, S 2 (x, y) = D(xy 2 , 1) 



2 1 

' V s 

In ^l^J, they give a solution for D(x,y, 1) in terms of a continued fraction. 

1 



D(x,y,l) =Y,D k {y)x k ,D k {y) 



k 



y 

l - 



i- y 



i 



This yields the solution for S 2 - 

s 2 (x, y ) = Y,y 2kD ^> 

The first terms of S 2 are given by: 

S 2 (x, y) = x + x 2 y 2 + x 2 + x 3 y 4 + 3 x 3 y 2 + x 3 + x 4 y 6 + 7 x 4 y 4 + 5 x 4 y 2 + x 4 
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Average edit distance In JOjj they determine analytically the average height of a planar tree 
with n edges which is ^Jun — |. Thus, the average edit distance is 2(n — •Jim + |) = 2n. 

6 Conclusion 

In section 2.2, we define the edit operations to be insertion and deletion. Indeed we omitted a third 
one, the relabeling operation. Instead of working with unlabeled trees, we study trees whose vertices 
arc labeled and the relabeling operation consists in changing the label of a vertex. 

The general case where the trees are labeled and the different edit operations have different 
costs can be obtained in a similar way. Define a decorated one-stack sortable permutation as a 
one-stack sortable permutation where each number is indexed by a letter; l e 5 a 2 a 4&3d7t,6 c represents 
the following tree: 
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The operations on decorated one-stack sortable permutations are almost the same as before and 
the relabeling operation consists in changing one letter. Ci,Cd,c r are respectively the insert, delete 
and relabeling unitary costs. There exists only a difference for the insertion of a new free edge. In 
the unlabeled case, we did not take into account the insertion of a leaf with no sibling. Thus we 
define a fourth insertion operation as: 

• (A — > i) where i is a complete factor of size 1 of the permutation a = uiv. a' = [u] a [i] a a[v] a 
where a = i. 

Let <j\ and oi be two decorated one-stack sortable permutations with the same underlying 
permutation. The label distance ^(01,02) is equal to the string distance between both labeled 
words. 
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Let T\ and T2 be two decorated one-stack sortablc permutations. We denote by a subpermutation 
a of T\ and T2 a normalized subpermutation without label. £7^ is the set of all sub-decorated one- 
stack sortable permutations of T\ which underlying permutation is a. 

The relabeling distance between T\ and T2 with respect to a is: 

d .(T 1 ,T 2 ) = min{c r d(a, 0), Va G S^ l; /3 G £t 2 } 

The distance between these two decorated one-stack sortable permutations T% and T2 is given 
by min{ci(\Ti\ — |er|) + Cd(|T2| — \a\) + da-(Ti, T%), a normalized subpermutation of Ti,T?} 
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